
Mansucript title:"Bank Concentration and Schumpeterian Growth: Theory and Evidence"
Authors: Boubacar Diallo and Wilfried Koch

*-----------------------------------------------------------------------
*-----------------------------------------------------------------------

The data file datadk.dta and the Stata do file datadk.do are zipped in the
file Data_DK_RESTAT.zip. 

To access the data and to execute the program files using the Arthur Lewbel (2012) method, STATA
ivreg2 version 14.2.1.15 or greater must be used

*-----------------------------------------------------------------------
*-----------------------------------------------------------------------
                           VARIABLES
*-----------------------------------------------------------------------
*-----------------------------------------------------------------------

The variables are as follows:

growthpc: The average per-capita GDP growth rate over the period 1985-2011.
proxpc: The proximity of country i to the world technology frontier (the United States), measured as the logarithm of the ratio of the
initial per-capita real GDP of country i to the initial per-capita real GDP of the USA in the period 1985.
findev: Private credit, calculated as the average of the credit provided by the banking sector, including all credit to various sectors 
on a gross basis, with the exception of credit given to the central government, which is net.
conc3: Bank concentration (Conc3), calculated as the average of the share of assets of the three largest banks in terms of total bank assets.
conc5: Bank concentration (Conc5), calculated as the average of the share of assets of the five largest banks in terms of total bank assets.
meancost_3: Bank costs using the net interest margin, which equals to the average of the accounting value of a bank's net interest revenue as 
a share of its total earning assets.
legoruk: Dummy variable legal origin for English legal system.
legorfr: Dummy variable legal origin for French legal system.
legorge: Dummy variable legal origin for German legal system.
cathos: Share of Catholics in country i.
muslims: Share of Muslims in country i.
protmgs: Share of Protestants in country i.
school: Human capital, measured as the average of total enrolment in secondary education, regardless of age, expressed as a percentage of the
population of official secondary education age.
M2: Money growth, calculated as the average annual growth rate in money and government consumption as a percentage of GDP.
govc: Government consumption, calculated as the average of all current government expenditures for purchases of goods and services (including the 
compensation of employees).
credrights: Average aggregate score of creditor rights.
corrup: Average aggregate score of corruption.
landlocked: Geography, measured by a dummy variable that takes the value of one if country i is landlocked and 0 if not.
proxlegoruk: Interaction between the proximity of country i to the world technology frontier and the dummy variable English legal origin.
proxlegorfr: Interaction between the proximity of country i to the world technology frontier and the dummy variable French legal origin.
proxlegorge: Interaction between the proximity of country i to the world technology frontier and the dummy variable German legal origin.
proxcatholic: Interaction between the proximity of country i to the world technology frontier and the variable cathos.
proxmuslim: Interaction between the proximity of country i to the world technology frontier and the variable muslims.
proxprotest: Interaction between the proximity of country i to the world technology frontier and the variable protmgs.
interpcconc3: Interaction between the proximity of country i to the world technology frontier and bank concentration (conc3). 
interpcfindev: Interaction between the proximity of country i to the world technology frontier and financial development (findev).
interpcconc5: Interaction between the proximity of country i to the world technology frontier and bank concentration (conc5) .
interpchk: Interaction between the proximity of country i to the world technology frontier and school.
interpcland: Interaction between the proximity of country i to the world technology frontier and the dummy variable landlocked.
interpccredrights: Interaction between the proximity of country i to the world technology frontier and creditor rights (credrights).
interpccorrupt: Interaction between the proximity of country i to the world technology frontier and corruption (corrup).
interpcgovc: Interaction between the proximity of country i to the world technology frontier and government consumption (govc).
interpcM2: Interaction between the proximity of country i to the world technology frontier and money growth (M2).

*-----------------------------------------------------------------------
*-----------------------------------------------------------------------
                            SOURCES
*-----------------------------------------------------------------------
*-----------------------------------------------------------------------

The measure of real per-capita GDP comes from the new Penn World Table 8.1, publicly available at:
http://www.rug.nl/research/ggdc/data/penn-world-table.
The measures of private credit, bank concentration and bank costs are from the 2013 Global Financial Development Data Database (GFDD) of the 
World Bank Group. The datasets are publicly available at:
http://data.worldbank.org/data-catalog/global-financial-development
Legal origin dummies, corruption and creditor rights come from Laporta et al. (2008), and their dataset is publicly available at:
http://mba.tuck.dartmouth.edu/pages/faculty/rafael.laporta/publications.html
The datasets on religion come from Laporta et al. (1999) and Beck et al. (2006).
The datasets on school, money growth and government consumption come from the World Development Indicators of the World Bank Group. The datasets
are publicly available at:
http://www.worldbank.org/
The dataset on geography, namely the dummy variable landlocked comes from CEPII and is publicly available at:
http://www.cepii.fr/

*-----------------------------------------------------------------------
*-----------------------------------------------------------------------
                             TABLES
*-----------------------------------------------------------------------
*-----------------------------------------------------------------------

Using the do.file "datadk.do" you will be able to generate Tables 1-6 of the paper.






